PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D13G0240
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family MYB
Protein Properties Length: 1506aa    MW: 164539 Da    PI: 6.225
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D13G0240genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding27.29.3e-09809850346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e++ d  + +G++ +++Ia  +  ++t  +c+++++k
      Gh_D13G0240 809 PWTSEEKEIFMDKLAAFGKD-FRKIATFLD-HKTTADCVEFYYK 850
                      8*****************99.*********.***********98 PP

2Myb_DNA-binding30.49e-1010281067445
                       S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45  
                       WT eE   +++av  +G++ ++ I+r++  +R++ qck ++ 
      Gh_D13G0240 1028 WTDEEKSVFIQAVSSYGKD-FAMISRCVR-TRSRDQCKVFFS 1067
                       *****************99.*********.********8776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466899.72E-14793854IPR009057Homeodomain-like
PROSITE profilePS5129314.53805856IPR017884SANT domain
SMARTSM007171.6E-7806854IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.5E-5808855IPR009057Homeodomain-like
PfamPF002497.8E-6808850IPR001005SANT/Myb domain
PROSITE profilePS5129311.17410231074IPR017884SANT domain
SMARTSM007171.0E-710241072IPR001005SANT/Myb domain
SuperFamilySSF466892.89E-1010261074IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.605.1E-610271068IPR009057Homeodomain-like
CDDcd001677.15E-710281066No hitNo description
PfamPF002493.4E-710281067IPR001005SANT/Myb domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1506 aa     Download sequence    Send to blast
MPQEPLPWDR KDIYKDRKHE RAELQPPPLL AARWREASSM SSYQHGSFRE FARWGSADFR  60
RPPGHGKQGN WHLFPEDIGG HGYVPWRSSD KILDGETYRQ SVSRGDGKYG RSYSRDNNRG  120
SYNQRDWRGH SLETSNGSPN TSVRPHDVNN EQRSVDDMFT YPSRTHSDFV NTWNQLQKDQ  180
HDNRTCGVNG LGTGQRCERE NSLGSVDWKP LKWSRSGSLS SWGSGFSHSS SSKSLGGVDS  240
GEAKLELHQK NLAPVQSPSG DAAACVTSAP PSDETTSRKK PRLGWGEGLA KFEKKKDGGP  300
DTSINSGGAA ISLCNTEPNT SLNSNLVDKS PRVLGFSDCS SPATPSSVAC SSSPGVEEKS  360
FGKAANIDND VNNLCGSPSF GSQNQLEGSS FSLEKLDINS IINMGSSLID LLQSDEPSTM  420
DSSFVQSTAI NKLLLWKGDI LKALEMTESE IESLETELKS SKDDPGRRCQ CPATSSSLPV  480
QENGKSCEEQ EAASSMIPQP APLKIDPSND VLEVLQEANA DIKDGVIDSP GTATSDFMLS  540
SSLEKAESLC DVVKVQDCSG NSSSAQLKTM EEVILATDSC NEEAAAVISG EGSVLVKIDN  600
EAHVPESSNS DAGGENMTCD VILTTNKELA NRSSLVFKKL FPEDQYSIEI SEISNAVWGQ  660
ISSLIREKIA MRKRHLRFKE RVLTLKFKAF QYAWKEDMLS PAMRKYWAKS QKKYELSLRS  720
TYGGYQKHRS SFRSRVTSSA GNLVLEPTAE MINFTSKLLL DSRVKLYRNA LKMPALILDE  780
QEQLSRFISS NGLVEDPCAI EKERALINPW TSEEKEIFMD KLAAFGKDFR KIATFLDHKT  840
TADCVEFYYK NHKSECFKKT TKKLDLTKQG KSSANTYLLT SGKKWSKEFN AASIDVLGAA  900
SVIATHAESG MQKHQTSSSR IFFGGRYSKI SRADDRIADR LSSFDIIGNN RETAAADVLA  960
GICGLLSSEA MSSCITSSVD PGESFHRDWK CHKVDSLLKR RSTSNVAQNV DDGTCSDESC  1020
GEMDPADWTD EEKSVFIQAV SSYGKDFAMI SRCVRTRSRD QCKVFFSKAR KCLGLDLIDP  1080
RTRNLGTPMS DDANGGGSDA EDACVLERLV VSSDKLGSKP EDLPSNILCT NMDERNPTSK  1140
PILPTDLNVP DENNRKLVDH RDSEAVQTVD SVAGLAELIS ECSVDMNIDS KAGSLQVQKS  1200
FVALGNLNAG RDVTEQGVSV AVSASLGAAA HPCTPSLDSV AVSEPATSLY ENDTKCSAET  1260
GSQSICRIDL NKASDESVGK NSCSGFSLSA KGLHQIPPDL DSAKKPSVSN NSSANGSALH  1320
DSDGLRCEKI CNLGRLSSTL DYKENEAKQA QKSVREDESG RLSGKTSVNV TEPHRILRGY  1380
PLQVSTLKEM NGDVKCLATS KRGSAGPCLA QECYLQKCNS SKSAAELPLL VENLEQAKDR  1440
PKSHCRISDT ENPGRNGNVK LFGQILNSSS RDDKMEQQAL QTVVYGTDRT LNGVSFPLKG  1500
NKQQQR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C1e-17768858494NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D1e-17768858494NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX5788050.0JX578805.1 Gossypium hirsutum clone NBRI_GE10901 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012454302.10.0PREDICTED: uncharacterized protein LOC105776280 isoform X1
RefseqXP_012454301.10.0PREDICTED: uncharacterized protein LOC105776280 isoform X1
TrEMBLA0A0D2S7670.0A0A0D2S767_G
STRINGVIT_13s0019g04010.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein